Robust speech recognition against packet loss
نویسندگان
چکیده
Recognizing speech transmitted over mobile or computer networks poses new challenges such as packet loss in transmission. Viterbi algorithm, the most common speech recognition approach, searches for the most likely state sequence that explains all observation. However, because it implicitly sums the log observation probabilities, the resulting solution is sensitive to outlier frames. In this paper, we propose a robust approach that searches the state sequence that best explains x percent of the observation and is insensitive to the corruption of a limited number of frames. We evaluated the proposed algorithm on the TIdigits task. With 10% of the data loss, the proposed algorithm achieves improvement of WER 71.6% for isolated digit recognition and 32.2% for connected digit recognition.
منابع مشابه
Robust speech recognition over packet networks: an overview
Conventional circuit-switched networks are increasingly being replaced by packet-based networks for voice communication applications. Additionally, there has been an increased deployment of services supporting speech based interactions. These trends demand reliable transmission of speech data not just for playback but also to ensure acceptable automatic speech recognition (ASR) performance. In ...
متن کاملGraceful degradation of speech recognition performance over packet-erasure networks
This paper explores packet loss recovery for automatic speech recognition (ASR) in spoken dialog systems, assuming an architecture in which a lightweight client communicates with a remote ASR server. Speech is transmitted with source and channel codes optimized for the ASR application, i.e., to minimize word error rate. Unequal amounts of forward error correction, depending on the data’s effect...
متن کاملRobust Model for Networked Control System with Packet Loss
The Networked Control System in modern control widely uses to decrease the implementation cost and increasing the performance. NCS in addition to its advantages is inevitable. Nevertheless they suffer of some limitations and deficiencies. Packet loss is one of the main limitations which affect the control system in different conditions and finally may lead to system instability. For this reason...
متن کاملRobust Speech Recognition Aga
This paper develops a robust speech recognition algorithm against short-time noise, of which no prior knowledge of their spectral characteristics is known. However, we assume that these noises only affects certain part of the speech and are also known as partially temporal corruption in [1]. Examples of these short-time noises include door slam, click sound of keyboard or packet loss in network...
متن کاملNoise-Robust speech recognition of Co
Over the past several years, the primary focus of investigation for speech recognition has been over the telephone or IP network. Recently more and more IP telephony has been extensively used. This paper describes the performance of a speech recognizer on noisy speech transmitted over an H.323 IP telephony network, where the minimum mean-square error log spectra amplitude (MMSE-LSA) method [1,2...
متن کامل